SRC  TR  86-44 


Information  Theoretic  Analysis 
For  A  General  Queueing  System  At 
Equilibrium  With  Application  To 
Queues  In  Tandem 

by 

J.  Cantor,  A.  Ephremides  and 
D.  Horton 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

1986 

2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-1986  to  00-00-1986 

4.  TITLE  AND  SUBTITLE 

5a.  CONTRACT  NUMBER 

Information  Theoretic  Analysis  for  a  General  Queueing  System  at 

5b.  GRANT  NUMBER 

rAjuinui  min  wiui  /ippiitiuiuu  iu  queues  in  imiuciu 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

University  of  Maryland, Electrical  Engineering  Department, College 

Park, MD, 20742 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

see  report 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

18.  NUMBER 
OF  PAGES 

38 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


INFORMATION  THEORETIC  ANALYSIS  FOR  A  GENERAL  QUEUEING 


SYSTEM  AT  EQUILIBRIUM  WITH  APPLICATION  TO  QUEUES 

IN  TANDEM 


by 

J.  Cantor 

Business  and  Technological  Systems,  Inc. 
10210  Greenbelt  Road 
Suite  440 

Seabrook,  Maryland  20706 
A.  Ephremides 

Electrical  Engineering  Department 
University  of  Maryland 
College  Park,  Maryland  20742 


D.  Horton 

Formerly  of:  Westinghouse  Defense  &  Electronics  Center 
Baltimore,  Maryland 
Now  with:  Martin  Marietta  Aerospace 
Orlando,  Florida  32822 


INFORMATION  THEORETIC  ANALYSIS  FOR  A  GENERAL  QUEUEING  SYSTEM  AT 
EQUILIBRIUM  WITH  APPLICATION  TO  QUEUES  IN  TANDEM* 

ABSTRACT 

In  this  paper,  information  theoretic  inference  metholology  for  system 
modeling  is  applied  to  estimate  the  probability  distribution  for  the  number  of 
customers  in  a  general,  single  server  queueing  system  with  infinite  capacity 
utilized  by  an  infinite  customer  population.  Limited  to  knowledge  of  only  the 
mean  number  of  customers  and  system  equilibrium,  entropy  maximization  is  used  to 
obtain  an  approximation  for  the  number  of  customers  in  the  G|G|l  queue.  This 
maximum  entropy  approximation  is  exact  for  the  case  of  G  »  M,  l.e.,  the  M|M|l 
queue.  Subject  to  both  independent  and  dependent  information,  an  estimate  for 
the  joint  customer  distribution  for  queueing  systems  in  tandem  is  presented. 
Based  on  the  simulation  of  two  queues  in  tandem,  numerical  comparisons  of  the 
joint  maximum  entropy  distribution  is  given.  These  results  serve  to  establish 
the  validity  of  the  inference  technique  and  as  an  introduction  to  information 
theoretic  approximation  to  queueing  networks. 


*This  work  was  supported  under  a  Naval  Research  Laboratory  Fellowship  under 
Grant  N00014-83G-0203  and  under  an  ONR  Grant  N00014-84K-0614. 


I.  INTRODUCTION 


Classical  queueing  theory  [1,2,3]  has  proven  to  be  quite  successful  in 
modeling  both  communication  networks  [4,5]  and  computer  systems  [7].  In  most 
cases,  rather  unrealistic  assumptions  about  either  the  underlying  arrival 
process  (e.g.,  Poisson)  or  service  distribution  (e.g. ,  negative  exponential) 
must  be  employed  to  obtain  results  as  it  is  under  these  assumptions  that 
queueing  theory  most  easily  yields  solutions.  Because  the  processes  of  "real 
world"  systems  generally  do  not  obey  this  exponential  structure,  the  success  of 
the  models  has  remained  in  doubt. 

Shore  [8]  has  proposed  an  explanation  based  on  information  theoretical 
system  modeling  and  the  princiiple  of  maximum  entropy  or,  more  generally,  the 
principle  of  relative-entropy  [9].  In  Shore’s  approach,  an  abstract  system 
model  consisting  of  various  "states"  is  introduced.  The  probability  of  the 
occurrence  of  a  certain  state  is  estimated  by  the  maximum  entropy  distribution 
subject  to  known  information  in  the  form  of  expected  values  of  functions  of  the 
states.  A  relationship  between  the  abstract  system  and  the  actual  system  is 
thereby  established  and  estimates  of  desirable  probability  distributions  of  the 
model  are  obtained.  Shore  utilizes  this  technique  to  derive  both  the  equili¬ 
brium  and  time-dependent  probability  distribution  for  the  nimber  of  customers 
(number  of  jobs,  number  of  calls,  etc.)  in  M|M|«»|  |N  and  M(m|«  queueing  systems. 

In  this  paper,  information  theoretic  system  modeling  is  applied  to 
estimate  the  probability  distribution  for  the  number  of  customers  in  a  general, 
G|G|l,  queueing  system.  The  extension  to  queueing  networks  is  considered 
through  the  approximation  of  N  systems  in  tandem. 

Bene£  [10],  it  appears,  first  proposed  use  of  maximum  entropy  and  statisti¬ 
cal  mechanical  analysis  of  large-scale  communication  systems.  For  a  telephone 
system  in  which  only  the  expected  number  of  calls  in  progress  is  known,  Benes 
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shows  that  the  maximum  entropy  distribution  is  precisely  that  which  is  obtained 
as  the  equilibrium  distribution  of  an  ergodic,  reversible  birth-death  Markov 
process  with  constant  birth  and  death  rates.  A  direct  implication  of  this 
result  is  that  constrained  only  by  the  expected  number  of  customers  in  the 
system,  the  maximum  entropy  approximation  to  a  single  service  queueing  system  is 
the  celebrated  M|M|l  queue.  The  maximum  entropy  distribution  is,  of  course, 
obtained  independently  of  the  underlying  stochastic  processes  and  under 
moderately  few  technical  assumptions  [9}. 

Ferdinand  [11]  uses  the  principles  of  statistical  mechanics  to  derive  the 
soution  to  the  M]M|ljN  (finite  capacity)  queue.  In  later  work,  Shore  [12] 
establishes  maximum  entropy  (termed  information  theoretic)  approximations  for  a 
number  of  "performance  distributions"  of  M  |G  1 1  and  G|G{l  queues  at  equilibrium. 
These  performance  distributions,  such  as  the  number  of  customers  in  the  system, 
a  customer's  waiting  time,  or  the  number  served  in  a  busy  period,  are  estimated 
subject  to  moments  of  the  interarrival  and  service  time  distributions.  It  is 
demonstrated  that,  using  relatively  few  moments,  that  maximum  entropy  provides 
good  approximations  to  a  variety  of  M|G|l  systems.  Further,  for  many  of  the 
distributions,  the  approximations  yield  exact  results  when  G  ■  M.  Using  an 
approach  similar  to  that  of  Shore,  El-Affendi  and  Kouvatsos  [13]  independently 
establish  a  maximum  entropy  approximation  to  the  number  of  customers  in  a  M|G|l 
system  as  well  as  the  service  distribution  corresponding  to  the  estimate. 
Further,  an  approximation  to  a  specific  G j M j 1  queue  is  determined. 

In  Section  II  of  this  paper,  the  general  relative  entropy  formalism  and 
technique  for  solution  are  presented,  and  the  specialized  case  of  entropy  maxim¬ 
ization  is  discussed.  These  techniques  are  applied  in  Section  III  to  derive  the 
maximum  entropy  approximations  to  the  G|g|1  queueing  system.  In  Section  IV, 
utilizing  the  GjGjl  approximation,  the  maximum  entropy  approximation  to  the 
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joint  distribution  of  the  number  of  customers  in  N  queues  in  tandem  is  estab¬ 
lished.  A  discussion  of  maximum  entropy  "product  form"  solutions  subject  to 
information  on  the  marginal  and  joint  distribution  is  included.  In  Section  V, 
numerical  comparisons  of  the  maximum  entropy  approximation  for  known  or  simu¬ 
lated  distributions  for  two  queues  in  tandem  are  given.  The  paper  is  concluded 
by  a  general  discussion  in  Section  VI. 

H.  PROBLEM  STATEMENT  AND  RELATIVE-ENTROPY  MINIMIZATION 
Consider  a  system  that  has  a  countable  set  S  of  possible  states  with 

p(Si)>  0,  SieS  ,  i  -  1,2,...  .  (1) 

I  P(S  )  -  1  (2) 

i 

where  p(S^)  is  the  probability  of  the  occurrence  of  the  state  S^.  Assume  that 
there  exists  a  "true"  distribution,  q+eD  which  is  unknown.  It  is  desirable  to 
estimate  this  distribution  q  based  on  incomplete  information.  Let  p  be  the 
current  estimate  or  initial  value  distribution  of  q+. 

Suppose  new  information  about  q+  becomes  available  in  the  form  of  expected 
values  of  known  functions,  f gi  l  -  1,2, ...,M,  of  the  states  as  follows: 

I  <l+(S1)fA(Si)  “  <f*>  •  (3) 

The  constraints  (l)-(3)  do  not  precisely  identify  q+.  Indeed,  besides  the  true 
distribution,  q+,  there  exists  a  subset  of  distributions  D*  of  D  which  also 
satisfies  all  constraints.  One  way  of  uniquely  choosing  an  estimate  for  q+, 
well-accepted  in  the  literature  [14,15,9],  is  the  method  of  minimizing  the  rela¬ 
tive  entropy  (also  known  as  cross-entropy,  Kullback-Leiber  number,  directed 
divergence,  or  discrimination  information),  namely,  choose  qeD  so  that  H[q,p] 
defined  by 
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(4) 


.  q(s.) 

Hlq,p]  -  l  q(S1)log(~(rs-- y) 


Is  minimized.  The  choice  of  q  as  above  is  called  the  final  value  distribution. 
For  estimating  probability  distributions,  relative-entropy  minimization  has  been 
shown  to  be  self-consistent  and  uniquely  correct  19],  and,  therefore,  the 
estimates  are  sometimes  called  information  theoretic  approximations.  It  turns 
out  [15,16,17]  that  if  there  exists  a  solution  to  (4)  such  that  the  constraints 
(l)-(3)  are  satisfied,  then  that  solution  has  the  form 

M 

q(si>-  p(Si)exp(-6o  -  J  (5) 

£-  1 


at  all  states,  except  possibly  on  a  set  for  which  q  is  identically  zero  [16]. 

In  (5),  0.,  £  ■  0,1 . M  are  Lagrangian  multipliers.  Further  if  0  can  be 

*  * 

determined  such  that  the  constraints  (l)-(3)  are  satisfied,  then  a  solution  to 
(4)  exists*  and  is  given  by  (5).  From  (2)  and  (5),  the  following  partition 
function  exp(0Q)  given  by 

M 

exp(0Q)  -  l  exp(-  l  0  f  (S  ))  (6) 

i  £»1 

or 

M 

0  -  log  l  exp(-  l  0,f  ( S  ))  (7) 

°  ±  £-1  x  *  1 

can  be  defined,  noting  that  0q  is  a  function  of  the  other  multipliers.  If  the 
sum  in  (7)  converges  and,  in  particular,  assumes  a  closed  form  solution,  then 
the  multipliers  may  be  determined  via  the  following  relations: 


<v 


(8) 


*The  issue  of  existence  of  a  solution  has  been  studied  by  Cslsz&r  [16]. 
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In  the  event  that  no  closed  form  is  obtainable  or  the  system  (8)  Is  inconsis¬ 
tent,  the  multipliers  must  be  approximated  by  numerical  techniques  [18].  For 
systems  that  have  an  uncountable  number  of  system  states,  the  sums  are  replaced 
by  integrals  in  the  usual  way. 

Entropy  Maximization 

When  little  is  known  a  priori  about  the  distribution  to  be  estimated,  a 
natural  choice  for  the  initial  value  distribution  is  one  in  which  all  states  are 
given  equal  weight,  i.e.,  the  uniform  on  S.  In  this  case,  the  solution  q  in  (4) 
is  said  to  maximize  the  entropy  H  given  by 

H[q]  -  -  l  q(S .)log(q(S  ))  .  (9) 

i 

When  a  solution  exists,  q  is  denoted  the  maximum  entropy  approximation  to  q+ 
and  is  given  by 

M 

q( s^)  -  exp(-so  -  l  )  .  (10) 

i®  1 

Note  that  (10)  is  identical  to  (5)  with  the  initial  value  distribution  deleted. 

A  large  value  of  the  entropy  functional  (9)  corresponds  to  a  high  degree  of 
uncertainty.  The  maximum  entropy  distribution  can  thus  be  interpreted  as  the 
probability  distribution  that  reflects  maximum  uncertainty  while  utilizing  all 
available  information.  In  this  sense,  it  is  the  "least  biased"  or  "most  conser¬ 
vative"  distribution  one  can  propose  which  satisfies  all  constraints. 

In  the  following  section,  the  methodology  of  relative-entropy  minimization 
is  applied  to  obtain  approximations  to  the  number  of  customers  in  a  G ) G  j 1  system 
at  equilibrium. 
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III.  MAXIMUM  ENTROPY  APPROXIMATION  TO  GENERAL  QUEUEING  SYSTEMS 


Consider  a  queueing  system  consisting  of  an  Infinite  customer  population 
from  which  Individual  customers  arrive  singly  with  Interarrival  times 
Identically  distributed  according  to  a  general  distribution  A(t).  The  customers 
(possibly)  wait  in  an  infinite  capacity  buffer,  then  are  served  individually  by 
a  server  according  to  a  general  service  time  distribution  B(t),  and  finally 
return  to  the  arriving  portion  of  the  customer  population.  The  queueing  system 
is  assumed  to  be  at  equilibrium  with  the  steady  state  distribution  to  the  number 
of  customers  in  the  system  denoted  by  q+. 

Available  information  in  a  queueing  system  is  often  in  the  form  of  moments 
of  the  interarrival  distribution 

a  -  /tmdA(t)  m  *  1,2,... 

m 

and  the  service  time  distribution 

s  *  ft  dB (t )  ®  *  1*2» •••  • 

m 

The  ratio  of  the  first  moments,  p  «  s^/a^,  *s  denoted  the  utilization  factor, 
and  it  determines  the  traffic  intensity  or  "loading"  on  the  system  [2]. 

Based  on  the  expected  number  of  customers  in  the  system,  <K>,  the  maximum 
entropy  approximation  to  q+(K)  can  be  obtained  subject  to 

m 

l  q(K  -  k)  -  1  (11) 

k-0 

and 

OP 

l  kq(K  -  k)  -  <K>  .  (12) 

k-0 

From  (6)-(8)  and  (10),  it  follows  that  (see  [12]  for  details) 
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q(K  -  k) 


1 


(13) 


<K> 


1  +  <K>  '1  +  <K> ' 


The  distribution,  q(K),  has  nothing  specifically  to  do  with  a  queueing  system 
but  is  the  maximum  entropy  distribution  obtained  from  a  single  moment  constraint 
and  normalization  and  (13)  is  applicable  to  any  system  whether  or  not  the  system 
is  at  equilibrium  when  only  the  mean  of  the  distribution  to  be  estimated  is 
available.  What  is  of  interest  here  is  that  (13)  is  the  formula  to  the  steady 
state  distribution  of  the  M]M|l  queueing  system  [2].  This  can  be  seen  by  noting 
that  for  the  M|M|l  queue,  q+(K),  is  given  by 


q+(K  -  k)  -  (1  -  p)(p)k 


(14) 


and 


<K> 


(15) 


so,  putting  (15)  into  (14),  (13)  follows.  The  implication  of  the  result  is  that 
the  maximum  entropy  approximation  to  a  G|G|l  system  subject  to  only  the  mean 
number  of  customers  is  the  M jM j 1  queue.  It  turns  out  that  this  approximation  is 
sometimes  a  satisfactory  estimate  for  the  M {G ( 1  system  [12],  especially  when  the 
service  distribution  is  close  to  exponential,  e.g.,  the  M  jH^ 1 1  system,  but  as  a 
general  G|g|1  approximation,  there  is  no  reason  to  assume  (13)  will  be  "close." 

To  specify  the  queueing  system,  we  impose  additional  constraints.  Namely, 
as  is  well-known  [1,2],  all  single  server  systems  at  steady  state  satisfy  the 
equilibrium  condition 

■  _L  '  (16) 

si  ai 

which  has  the  interpretation  that  the  average  rate  of  arrivals  to  the  system  is 
equal  to  the  average  rate  of  departures.  Now,  subject  to  (11)-(12)  and 
rewriting  (16)  as 
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q ( 0)  -  1  -  p 


(17) 


where  p  ■  0j/aj,the  following  queueing  system  approximation  can  be  considered. 
The  constraint  (17)  can  be  posed  as 
00 

l  I(k)q(K  -  k)  -  1  -  p 
k-0 

where 

(1  if  k  -  0 

I(k)  -  )  .  (18) 

(0  if  k  >  0 

The  maximum  entropy  solution  has  the  form 

q(K  *  k)  «  exp(-0Q  -  gjk  -  82I(k))  .  (19) 

For  notational  convenience,  pose 

x:  -  exp(-gg)  y:  -  exp(-$1)  z:  -  exp(-B2) 

and  observe  from  (19)  that 

'  xz  k-0 

q(K  -  k)  -  \  .  .  (20) 

(  xy  k  >  0 

From  (11),  (12),  (17),  and  straightforward  manipulations,  we  obtain: 


8 


Substituting  the  values  for  x,y,z  in  (20),  the  approximation  to  the  G|G|l 
queueing  system  is  given  by 


q(K  -  k)  - 


1  -  p 


k  -  0 


<K>  -  p.k 
<K>  ' 


k  >  0 


(21) 


For  the  particular  case  of  p  ■  <K>/(1  +  <K>),  (21)  reduces  to  q(K  ■  k)  ■ 

I- 

(1  -  p)(p)  the  exact  m|m|1  formula  as  expected.  It  is  noteworthy  that  in  [12] 
(17)  is  applied  not  as  a  constraint  but  as  a  condition  on  the  solution,  (13). 
This  results  in  a  G|G|l  approximation  given  by  (14)  which  is  clearly  less 
general  than  (21). 

In  the  following  section,  utilizing  the  approach  which  led  to  the  G|G|l 
approximation,  maximum  entropy  approximations  to  the  joint  distribution  of  the 
number  of  customers  in  more  than  one  queueing  system  is  examined  and  the  approx¬ 
imation  to  queues  in  tandem  is  presented. 


IV.  MAXIMUM  ENTROPY  APPROXIMATION  TO  QUEUEING  SYSTEMS  IN  TANDEM 

Under  certain  conditions,  the  relative-entropy  joint  distribution,  or, 
specifically,  the  maximum  entropy  distribution,  is  equal  to  the  product  of 
relative-entropy  distributions.  The  concept  of  product  form  distribution  is 
stated  precisely  below. 

Definition:  A  joint  probability  distribution  q+(Kj . . . ,KN)  is  of  the 
product  form  if 

+  N 
q  (Kj.Kj,...,^)  -  n  q+a±)  , 

i«l 

i.e.,  the  joint  distributrion  is  equal  to  the  product  of  the  marginal  distribu¬ 
tions  q+(K^),  1  <  i  <  N. 
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The  above  definition  is,  of  course,  equivalent  to  the  random  variables 

being  statistically  independent.  Much  of  this  section  will  be 
devoted  to  the  discussion  of  when  the  approxlation -is  of  the  product  form. 


Maximum  Entropy  Approximation  to  N  General  Queueing  Systems. 

Subject  to  constraints  only  on  the  mean  number  of  customers  in  each  system 
and  equilibrium,  it  is  now  shown  that  the  maximum  entropy  approximation  to  the 
joint  distribution  of  the  number  of  customers  in  N  queueing  systems  is  a 
"product  form"  solution.  The  equilibrium  condition  for  each  system  is  as 
before;  the  average  arrival  rate  is  equal  to  the  average  departure  rate.  For 
notational  simplicity,  let 


<K^>:  *  the  expected  number  of  customers  in  system  i 


and 


s 


(i) 
1 


p  :  ■  -/jT  "  the  utilization  factor  for  system  i 
al 


where  s^^  and  a^  are  the  first  interarrival  and  service  moments  of  system  i. 
Under  the  above  assumptions  on  the  constraints  of  the  joint  distribution,  the 
following  result  can  be  shown. 

Proposition  1;  The  maximum  entropy  approximation  to  the  distribution  of  the 
number  of  customers  in  N  G|Gjl  queueing  systems  at  equilibrium  is  of  the  product 
form,  i.e., 


N 


q(K.,K  ,...,KJ  -  H  q(K  ) 
1  ”  i*l  1 


(22) 


where 


1  “  Pa 


q(Ri  ■  1^) 


<V  -  <>i 


,.<V  -  Pi.k 

V  (  /If  \  ) 


-  0 

k±  >  0 


(23) 


<Ki> 
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Proof : 


Because  there  are  constraints  only  on  the  marginal  distributions,  the  Joint 
maximum  entropy  distribution  has  the  form 

2N 

q(K1,K2,...,KN)  -  exp(-B0  -  I  *ifi(Kl’K2’  —  ’V } 

where,  by  assumption,  f^(K^ ,K2» • • . ,K^)  is  a  function  of  only  K^.  Specifically, 

2N  N 

exp(-Bo  -  l  Bifi(K1,K2,...,KN))  -  exp(-Bo)  H  expC-B^  -  X  I(K  ) 
i-1  i*l 

by  simply  relabeling  the  multipliers  02i  »  I(K^ )  is  defined  by  (18).  Under 

the  assumption  that  the  marginal  distributions  satisfy  the  normalization 
constraint,  the  product  form  follows.  By  (22),  it  is  clear  that  the  maximum 
entropy  approximation  to  each  marginal  distribution  is  exactly  the  G|G|l  approx¬ 
imation  given  by  (21)  and  (23)  is  obtained.  Q.E.D. 

It  should  be  emphasized  that  Proposition  1  is  not  a  result  on  queueing  net¬ 
works.  A  queueing  network  is  an  inter-connected  group  of  queueing  systems  where 
customers  can  enter  an  individual  "node"  (queueing  system),  extract  service,  and 
then  either  depart  the  network  completely  or  go  to  another  node  and  extract  more 
service.  In  order  to  discuss  approximating  distributions  to  such  systems,  many 
new  parameters  must  be  considered,  such  as  the  network  topology  or  the  possible 
transitions  between  systems.  To  utilize  information  theoretic  analysis,  it  is 
necessary  to  capture  the  system  interaction  through  appropriate  equilibrium 
constraints.  In  the  certain  instances  where  one  has  knowledge  of  rate  balance 
equations  for  each  individual  node  of  a  network  as  well  as  system  equilibrium 
rate  balance  equations,  one  possible  approach  follows  from  relative-entropy 
minimization  subject  to  fully  decomposable  subset  and  aggregate  constraints  [19]. 
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This  technique  is  not  considered  here;  instead,  it  turns  out  that  the  approach 
utilized  for  the  single  queueing  system  is  applicable  to  an  elementary  queueing 
network,  namely,  queueing  systems  in  tandem. 


Maximum  Entropy  Approximation  to  Tandem  Queues  at  Equilibrium. 


A  simple  queueing  network  topology  is  N  queues  in  tandem.  The  tandem 
queueing  network  of  consideration  consists  of  N  queueing  systems  in  which 
customers  departing  node  i  immediately  enters  node  i+1.  It  is  assumed  that 
once  a  customer  enters  the  first  node,  he  must  extract  service  at  each  node 
before  departing  the  network  and  reentering  the  arriving  customer  population. 
The  arrival  rate  for  node  1+1  is  therefore  the  departure  rate  for  node  i.  It 
is  further  assumed  that  the  system  as  a  whole  is  at  equilibrium,  so,  by  the 
topology,  it  follows  that 


1  -  q(K  -  0) 

m - "TT5 — 


F7 


Ji+l) 

al 


i.e.,  the  average  arrival  rate  to  all  nodes  is  identical  and  therefore  must  be 
equal  to  the  arrival  rate  to  the  first  node;  1/a^.  Hence,  subject  to  (24), 
equilibrium,  and  the  expected  number  of  customers  in  each  node,  (22)-(23),  is 
the  maximum  entropy  approximation  to  the  distribution  of  the  number  of  customers 
in  a  network  of  tandem  queues  where  has  the  particular  form 

pi  ‘“nr- 

a. 


Although  the  interconnection  of  the  tandem  queues  can  be  addressed  by  (24), 
the  form  of  the  approximation  (22)  is  generally  incorrect.  The  only  known  tan¬ 
dem  network  with  a  product  form  distribution  for  the  number  of  customers  in  the 
network  is  one  with  Poisson  arrivals  to  the  first  node  and  where  customer  ser¬ 
vice  times  in  each  node  are  distributed  according  to  an  exponential  distribution. 
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Moreover,  the  service  times  for  a  specific  customer  are  independent  from  node  to 


node.  The  product  form  follows  directly  from  Burke's  Theorem  [2],  and  it  turns 
out  that  each  node  is  an  M|M|l  queue  [2].  Thus,  in  order  to  obtain  a  maximum 
entropy  approximation  to  the  tandem  network  which  is  not  of  the  product  form,  it 
1 8  necessary  to  repose  the  inference  problem  and  obtain  the  approximation  sub-* 
ject  to  "dependent"  information.  This  notion  of  dependent  information  is  now 
made  precise. 

Maximum  Entropy  Approximation  Subject  to  Joint  Information. 

Subject  to  information  only  on  the  marginal  distribution,  the  maximum 
entropy  approximation  to  a  joint  probability  distribution  is  of  the  product  form 
[17].  Conversely,  if  there  is  any  hope  of  capturing  the  possible  dependence  of 
random  variables,  it  is  necessary  to  use  constraints  which  give  information  on 
two  or  more  random  variables  simultaneously,  l.e., 

_  E  .  f(Kl'K2 . V"<K1-K2 . V  ’  <£<VK2 . V>  • 

2' 

Constraints  of  this  form  are  on  the  joint  distribution  and  will  be  termed  joint 
constraints.  A  special  case  of  interest  occurs  when  the  function  of  the  random 
variables,  f(Kj,K2 . K^)  is  of  the  form 

f(Kj,K2 . Kjj)  -  K*  ...  kJJ 

where  i,  j,  and  k  are  integers,  then  the  joint  constraint  is  a  joint  moment  of 
the  true  distribution. 

There  is  nothing  in  the  maximum  entropy  formalism  which  disallows  obtaining 
the  maximum  entropy  solution  subject  to  constraints  only  on  the  joint  distribu¬ 
tion.  Under  the  assumption  that  the  problem  is  well-defined.  It  turns  out  that 
all  marginal  distributions  are  identical.  It  appears,  however,  that  in  general. 
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nothing  can  be  stated  about  whether  the  approximation  to  the  joint  distribution 
is  of  the  product  form. 

The  relative-entropy  approximation  subject  to  constraints  on  both  the 
marginal  and  joint  distribution  can  still  be  of  the  product  form.  Under  the 
assumption  that  the  Initial  value  distribution  is  of  the  product  form,  necessary 
and  sufficient  conditions  are  presented  for  the  special  case  of  moment 
constraints.  The  proof  of  the  following  proposition  is  given  in  the  Appendix. 

Proposition  2:  The  relative-entropy  approximation  to  q+(Kj .K^, •• . subject 
to  both  marginal  and  joint  moment  constraints  is  of  the  product  form  if  and  only 
if  each  joint  constraint  is  equal  to  the  product  of  marginal  constraints 
satisfied  by  the  approximation. 

If  the  random  variables  are  correlated,  then  joint  constraints  can  be 
used  to  force  probabilistic  dependence  of  the  approximation.  In  the  present 
application  where  the  random  variables  represent  quantities  of  individual 
systems,  joint  constraints  thereby  provide  a  means  for  the  modeling  and  eventual 
analysis  of  system  interaction. 

As  a  special  application  of  the  preceding  result,  consider  the  maximum 
entropy  approximation  to  the  joint  distribution  q+(Kj,K2)  subject  to  normaliza¬ 
tion  and  the  covariance  of  the  random  variables  Kj  and  K^,  i.e., 

CovCKjK^:  -  l  (Kx  -  <K1»(K2  -  OC^qCK^) 

K1,K2 

This  is  not  a  valid  constraint  as  the  covariance  cannot  be  written  as  the 
expected  value  of  a  single  function  of  the  random  variables.  However,  if  the 
problem  is  posed  as  the  determination  of  the  maximum  entropy  distribution  sub¬ 
ject  to  <Kj>,  <K2>,  and  <KjK2>,  the  problem  is  generally  well-defined  and  the 
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solution  will  automatically  satisfy  the  covariance,  Cov(KjK2).  Noting  that 
CovO^K^  -  <K1K2>  -  <K1><K2> 

and  under  the  assumption  of  zero  covariance,  the  joint  constraint  is  equal  to 
the  product  of  marginal  constraints.  Hence,  subject  to  constraints  only  on  the 
means  and  the  first  joint  moment,  the  maximum  entropy  approximation  is  of  the 
product  form  if  and  only  if  the  random  variables  are  uncorrelated. 


Maximum  Entropy  Approximation  to  Two  Queues  in  Tandem. 

Using  the  preceding  ideas,  the  maximum  entropy  approximation  to  the  number 
of  customers  in  a  two-node  tandem  queueing  network  subject  to  both  marginal  and 
joint  constraints  is  now  presented.  Two  separate  approximations  to  q^K^.K^) 
are  considered. 

The  first  approximation  is  obtained  subject  to  normalization,  and  the 
independent  information:  <Kj>,  p^,  and  <K2>,  p2  where  p^  m  s ^ ^ ^ ^ 
already  noted,  the  maximum  entropy  solution  is  a  special  case  of  Proposition  1. 
Let  q^(Kj,K2>  denote  this  approximation  which  is  given  by 

qjCKj.Kj)  -  qjCK^q^K^ 

where 


1  ” 


q^i  -  kA) 


I  V<v  -  Pi 


<V  "  pi  k, 

■><  ■  <Kt>— >* 


ki  ■ 0 
k±  >  0 


i  -  1,2 


(25) 


The  second  approximation  is  obtained  subject  to  the  same  constraints  on 

q^CKj,!^)  and  the  first  joint  moment,  <KjK2>.  The  form  of  the  approximation 
denoted,  q^Kj,^),  18  given  by 


q2(KrK2)  "  exp(~$0  ■  eiKl  ■  B2I(V  "  e3K2  "  B4I(K2)  ” 


(26) 
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In  this  case,  no  analytic  form  could  be  found  and,  for  the  forthcoming  numerical 
results,  (26)  will  be  estimated  by  the  APL  function  of  Johnson  [18],  which  com¬ 
putes  estimates  of  relative-entropy  approximations  given  an  arbitrary  constraint 
matrix.  These  approximations  will  be  compared  in  the  following  section  for  a 
variety  of  two-node  queueing  networks. 


V.  NUMERICAL  RESULTS 

As  far  as  can  be  determined,  there  are  essentially  no  known  distributions 
for  the  number  of  customers  in  a  network  of  tandem  queues  except  for  a  tandem 
network  of  M|m)1  queues.  In  order  to  compare  the  approxiations,  a  simulation  of 
two  queues  in  tandem  is  performed  using  an  APL  function  which  computes  the  joint 

g 

distribution.  The  simulated  distribution  will  be  denoted  q  (K^.K^). 

To  verify  the  accuracy  of  the  simulation,  a  chi-square  goodness  of  fit  test 
[20]  was  performed  at  the  five  percent  level  for  the  M |M jl  M 1 1  system.  After 

performing  the  test,  it  was  determined  that  the  null  hypothesis;  the  distribu¬ 
tion  observed  is  the  joint  distribution  for  the  number  of  customers  is  a 
Jacksonian  tandem  network  [2]  of  M|M|l  queues  could  not  be  rejected. 

Two  different  networks  were  simulated,  one  in  which  customers  require 
service  according  to  the  same  general  distribution  in  each  node  but  have 
independent  service  times  and  one  in  which  a  customer  has  the  identical  service 
time  in  both  nodes.  One  reason  for  considering  these  networks  is  that  for  the 
simulation  distributions,  CovCKj,^)  was  generally  larger  than  for  systems  with 
independent,  Identically  distributed  service  times.  In  both  networks,  customers 
arrive  to  the  first  node  according  to  a  Poisson  process,  and  the  first  node  is 
therefore  an  M|G|l  queue.  A  comprehensive  analysis  of  the  network  where 
customers  have  identical  service  times  in  both  nodes  under  the  assumption  of 
Poisson  arrivals  to  the  first  queue  is  given  in  Boxma  [21],  the  particular  case 
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where  the  first  node  is  an  M|M|l  queue  is  considered  in  Pinedo  and  Wolff  [22], 

As  stated,  the  aproximations  ( 25)— ( 26)  are  for  tandem  networks  with 
general  interarrival  and  service  distributions.  The  forthcoming  examples  are 
restricted,  however,  to  systems  with  Poisson  arrivals  to  the  first  system. 

There  are  two  reasons  for  this:  one  is  the  aforementioned  interest  of  such  net¬ 
works;  the  other  reason  is  that  these  results  serve  to  extend  examples  presented 
by  Shore  [12]  for  single  M|G|l  systems.  Moreover,  in  the  first  three  examples, 
the  departure  process  from  the  first  node  is  not  Poisson  and  so  the  second 
system  is  truly  a  G|G|l  queue,  and  thus  the  numerical  results  give  some  insight 
on  maximum  entropy  approximations  to  general  queueing  systems. 

Numerical  results  are  presented  in  terms  of  the  joint  distribution  of  the 
simulated  distribution  and  the  two  maximum  entropy  approximations.  Several 
examples  are  given  based  on  various  service  distributions.  The  statistics 
<K  >,<K2>,  and  are  computed  from  q^Kj.Kj),  the  constraints,  1  -  P  j  and 

1  -  P2  are  derived  from  q+(Kj,K2).  The  approximations  are  compared  via  the 
following  measures:  the  relative  entropy  between  q^(K^,K2)  and  q  (K^,K2) 


Kj  ,K2 


qi (K1 *K2)log 


qs(K1,K1) 


i  “  1,2 


the  sum  of  the  square  of  the  pointwise  difference  between  q^Kj,!^)  and 

q^Kj.Kj), 

l  (q.(K  ,K  )  -  qS(K  ,K  ))2  i  -  1.2 

K1’K2 

g 

and  the  maximum  absolute  pointwise  difference  between  q^(Kj,K2)  and  q  (K^,K2) 


max  |q.(K  ,K  )  -qS(K  ,K  )|  i  -  1,2 

(KlfK2) 
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A  “small"  value  for  each  measure  Implies  the  distributions  are  "dose."  In  the 

case  where  q^Kj  “  kl,K2  “  is  c*08e  t0  q8(Ri  “  ^1**2  "  ^2^  *or  eacl*  pair 
kj,k2»  the  relative-entropy,  H[q^,q+],  can  be  considered  the  average  percentage 
of  relative  difference  between  the  distributions.  This  interpretation  is  valid, 
for  example,  when  the  maximum  absolute  pointwise  difference  is  a  very  small 
value  and  the  ratio  q^K^.K^/q  (K^,K2)  is  thereby  close  to  one.  It  then 
follows  that 


l  q1(K1,K2)log 


»k2) 

q9(K1,K2) 


(Kj,K2)(- 


q1(Kl,K2) 

q8(K1,K2) 


-  1)  . 


Example  1:  mJh^II  •+•  1 1 

In  this  first  example,  customers  arrive  to  the  first  node  according  to  a 
Poisson  process  with  rate  1/a^^  ■  1,  then  proceed  to  a  service  area  consisting 
of  two  parallel  servers.  The  service  distribution  is  the  hyperexponential 
distribution.  The  first  node  is  therefore  the  M|H2|l  system.  The  service 
density,  b(t),  is  given  by 

b(t)  -  ~  exp(-t)  +  «|exp(-2t)  (27) 


from  which  it  follows  that  s^  ■  5/8.  Hence,  Pj  “  5/8.  By  (24),  the  arrival 
rate  to  the  second  node  is  also  equal  to  one,  even  though  the  departure 
process  from  the  first  node  is  not  Poisson.  The  customers  receive  service  in 
the  second  node  with  density  given  by  (27),  but  each  customer  has  a  service 
time  length  independent  of  his  service  time  in  the  first  system.  Regardless 
of  this  independence  condition,  the  utilization  factor  of  the  second  system  is 
given  by  p2  -  5/8.  The  simulation  was  run  for  5,016  customers,  and  from 
q8(Kj,K2),  the  following  statistics  were  computed:  <K^>  ■  1.77,  <K2>  *  1.83, 

and  <K^K2>  *  3.14.  From  the  Pollaczeck-Khinchin  mean  value  formula  [2]  (cf. 
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page  187),  <Kj>  ■  1.79  when  computed  from  the  true  distribution,  q+(K^),  so  the 
simulation  value  is  extremely  close.  Note  that  Cov(Kj,K2)  *  -0.1;  thus,  the 
simulated  distribution  is  only  slightly  correlated. 

Example  2:  M|H2ll  ♦  l|l 

All  discussion  of  example  1  applies  to  this  second  example,  except  now 
customers  have  identical  service  times  in  each  node.  The  service  density  in 
node  one  is  again  given  by  (27),  so  the  first  system  is  an  M jH^ 1 1  queue  and  the 
network  will  be  denoted  by  MjH^ll  ♦  X )  1  where  "I"  indicates  identical  service 
times.  From  example  1,  it  follows  that  pj  ■  5/8  ■  p2»  The  simulation  was  run 
for  2,536  customers,  and  the  following  statistics  were  computed:  <K^>  ■  1.70  - 
<K2>  and  <KjK2>  -  4.1.  Hence,  Cov(K^,K2)  ■  1.22,  and  customers  having  identical 
service  times  in  each  node  significantly  increases  the  correlation  of  the  random 
variables  and  K2  as  expected. 

Example  3:  M  JD  1 1  1 1 1 

In  the  third  example,  customers  again  arrive  to  the  first  node  according  to 
a  Poisson  process  with  rate  equal  to  one.  Each  customer  then  demands  a  fixed 
amount  of  service  of  length  equal  to  0.5.  The  first  node  is  therefore  an  MjD|l 
system  with  service  time  density  given  by 

b(t)  -  6 (t  -  0.5)  (28) 

where  6  is  the  usual  Dirac  delta  function.  From  (28),  it  is  clear  that 
8 j  -  0.5,  and  by  a  similar  argument  as  in  the  preceding  examples,  th?  utiliza¬ 
tion  for  both  systems  is  equal  to  0.5,  i.e.,  p  ,  ■  0.5  «  p2«  The  simulation  was 
run  for  1,024  customers,  and  the  following  statistics  were  computed:  <K^>  - 
0.765,  <K2>  “  0.512,  and  <K^K2>  ■  0.75.  From  the  Pollaczeck-Khinchin  formula 
<Kj>  -  0.75,  when  computed  for  the  true  distribution,  so,  again,  the  simulation 
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statistic  is  very  close.  For  the  simulation  distribution.  CovCK^.K^)  ■>  0.13, 
so,  as  in  example  1,  the  random  variables  are  essentially  uncorrelated.  The 
small  correlation  likely  results  from  the  light  load  on  the  first  system;  this 
is  only  conjecture,  however,  and  more  investigation  is  necessary  to  establish 
any  connection  between  "loading"  and  correlation. 

Example  4:  M |M 1 1  1 1 1 

In  the  fourth  example,  customers  arrive  to  the  first  node  with  rate  equal 
to  0.8  and  demand  service  according  to  a  negative  exponential  distribution. 
Hence,  the  first  node  is  an  M|m|1  queue.  The  service  density  is  given  by 

b(t)  -  exp(-t) 

so  “  1.  Each  customer  has  identical  service  times  in  each  node,  and  thus 

this  network  differs  from  a  tandem  network  of  M|M|l  queues  discussed  in  connec¬ 
tion  with  Jackson's  and  Burke's  theorems.  The  utilization  factor  for  each  node 
is  given  by  ■  0.8  ■  P2»  The  simulation  was  run  for  4,840  customers,  and  the 
following  statistics  were  computed:  <K^>  ■  3.36,  “  2,83,  and  <KjK^>  ■ 

13.4.  From  the  Pollaczeck-Khinchin  formula,  <K^>  ■  4  when  computed  from  the 
true  distribution.  For  the  simulation  distribution  CovCKj.K^)  *  3.9,  which  is 
the  largest  correlation  among  the  examples. 

For  each  example,  the  results  of  the  comparisons  are  given  in  Tables  1-4. 

Due  to  the  generally  small  probabilities  of  any  given  state,  both  approxi¬ 
mations  q^Kj,!^)  and  q2(K^»K2^  are  "close"  to  the  simulated  distribution  under 
the  measures  of  maximum  pointwise  difference  and  the  sum  of  the  square  of 
pointwise  difference.  Hence,  the  relative-entropy  measure  (average  percentage 
of  relative  difference)  is  the  most  revealing.  By  the  relative-entropy  measure, 
both  approximations  in  examples  one  and  two  differ  from  the  simulation  by  less 
than  one  percent.  The  addition  of  the  joint  constraint  results  in  a  2.4  percent 
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Example  1:  q'(Kx,K2) 
Simulation  Distribution  for 
M|H2|1  -* >H2  Tandem  Network 
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2 

.0536 

.0304 

.0020 

.0132 

.0087 

.0057 

Kl  3 

.0349 

.0198 

.0130 

.0085 

.0056 

.0037 

4 

.0228 

.0128 

.0084 

.0055 

.0036 

.0023 

5 

.0148 

.0083 

.0054 

.0035 

.0023 

.0015 

Example  2:  q*{Kx,K 2) 
Simulation  Distribution  for 
M|H2|1  -»  I|1  Tandem  Network 


K  2 


0 

1 

2 

3 

4 

5 

0 

.2020 

.0764 

.0387 

.0249 

.0131 

.0083 

1 

.1020 

.0490 

.0257 

.0209 

.0120 

.0083 

2 

.0418 

.0265 

.0225 

.0173 

.0115 

.0086 

3 

.0199 

.0197 

.0166 

.0141 

.0085 

.0046 

4 

.0095 

.0142 

.0164 

.0122 

.0056 

.0047 

5 

.0031 

.0081 

.0070 

.0038 

.0036 

.0038 

Maximum  Entropy  Distribution  Subject  to  Utilization, 

<I<!>  —  1.70,<AT2>  =  1.70. 

K2 


0 

1 

2 

3 

4 

5 

0 

.1410 

.0855 

.0544 

.0346 

.0220 

.0140 

1 

.0855 

.0520 

.0331 

.0210 

.0134 

.0085 

2 

.0054 

.0331 

.0210 

.0134 

.0085 

.0054 

3 

.0346 

.0210 

.0134 

.0085 

.0054 

.0034 

4 

.0220 

.0134 

.0085 

.0054 

.0034 

.0022 

5 

.0140 

.0085 

.0054 

.0034 

.0022 

.0014 

92(A'i,/C2) 

Maximum  Entropy  Distribution  Subject  to  Utilization, 
<K  i>  =  1.70,</C2>  =  1.70, </C  \Kn>  —  4.1 

K  2 


0 

1 

2 

3 

4 

5 

0 

.1560 

.0925 

.0534 

.0303 

.0178 

.0103 

1 

.0925 

.0569 

.0342' 

.0205 

.0123 

.0074 

0 

.0534 

.0342 

.0214 

.0134 

.0084 

.0052 

3 

.0308 

.0205 

.0134 

.0087 

.0057 

.0037 

4 

.0178 

.0123 

.0084 

.0057 

.0038 

.0026 

5 

.0103 

.0074 

.0052 

.0037 

.0026 

.0018 

Example  3:  q*{Ki,K^ 
Simulation  Distribution  for 
MjD|l  —  I|1  Tandem  Network 


K  2 


0 

1 

2 

3 

4 

5 

0 

.2930 

.1950 

0 

0 

0 

0 

1 

.1530 

.1780 

0 

0 

0 

0 

2 

.0363 

.0916 

0 

0 

0 

0 

3 

.0049 

.0345 

0 

0 

0 

0 

4 

.0008 

.0101 

0 

0 

0 

0 

5 

0 

.0021 

0 

0 

0 

0 

7 

Maximum  Entropy  Distribution  Subject  to  Utilization, 

<I<t>  =  .765,<K2>  =  .512, 

K2 


0 

1 

2 

3 

4 

5 

0 

.2500 

.2440 

.0057 

1.34X10"4 

3.14X10"® 

7.37  X10"8 

1 

.1630 

.1600 

.0037 

8.77  X 10“® 

2.05X10"® 

4.81X10"® 

9 

md 

.0056 

.0055 

.0013 

3.04  X 10~® 

7.12  X10"7 

1.67X10"® 

3 

.0196 

.0019 

.0004 

1.05X10"5 

2.47  X10"7 

5.78  X10"9 

4 

.0067 

.0066 

.0002 

3.64  X  10“® 

8.84  X10"8 

8.54X10"® 

5 

.0023 

.0023 

.0005 

1.26X10"® 

2.96X10"® 

.6930  XI  O'9 

m  qAKvKi) 

Maximum  Entropy  Distribution  Subject  to  Utilization, 

<Kl>  =  .765, < /C 2>  =  512,  <KlKo>  =  .521 


K2 


0 

1 

2 

3 

4 

5 

0 

.2820 

.2150 

.0025 

2.95  X10"5 

3.46  XI 0"7 

4.05  X10"9 

1 

.1580 

.1700 

.0028 

4.94X10"® 

7.96  X10'7 

1.26X10"® 

2 

.0043 

.0065 

.0015 

3.54  X10"6 

8.25  X10"7 

1.92X10"® 

3 

.0118 

.0025 

.0008 

2.70X10"® 

8.98  X10"7 

2.92X10"® 

4 

.0032 

.0096 

.0004 

2.07x10"® 

9.56  X10"7 

4.43X10"® 

5 

.0009 

.0037 

.0002 

1.58  X10"5 

9.56  X10"7 

6.72  X10"8 

Example  4:  ql(Kl,Ki) 
Simulation  Distribution  for 
M|M|1  -*  I|1  Tandem  Network 


K  2 


0 

1 

2 

3 

4 

5 

0 

.0982 

.0382 

.0253 

.0175 

.0099 

.0077 

1 

.0527 

.0318 

.0247 

.0167 

.0125 

.0083 

2 

.0290 

.0242 

.0261 

.0186 

.0152 

.0079 

3 

.0169 

.0189 

.0218 

.0214 

.0152 

.0076 

4 

.0082 

.0114 

.0175 

.0217 

.0107 

.0095 

5 

.0040 

.0072 

.0116 

.0179 

.0090 

.0080 

Maximum  Entropy  Distribution  Subject  to  Utilization, 
<KX>  =  3.36,<A-2>  =  2.83, 

K2 


0 

1 

2 

3 

4 

5 

0 

.0400 

.0451 

.0324 

.0233 

.0167 

.0120 

1 

.0378 

.0427 

.0306 

.0220 

.0158 

.0113 

2 

.0289 

.0326 

.0234 

.0168 

.0121 

.0090 

K 1 

3 

.0221 

.0249 

.0179 

.0128 

.0092 

.0066 

4 

.0169 

.0190 

.0137 

.0098 

.0070 

.0050 

5 

.0129 

.0145 

.0104 

.0075 

.0053 

.0039 

Maximum  Entropy  Distribution  Subject  to  Utilization 

<I<X> 

=  3.36,</Co> 

—  2.83, 

</C1/Co> 

=  13.4 

0 

1 

I<2 

2 

3 

4 

5 

0 

.0467 

.0526 

.0346 

.0227 

.0149 

.0097 

1 

.0443 

.0509 

..0341 

.0228 

.0153 

.0102 

2 

.0315 

.0369 

.0251 

.0172 

.0117 

.0080 

K 1  3 

.0224 

.0267 

.0186 

.0129 

.0090 

.0062 

4 

.0159 

.0193 

.0137 

.0097 

.0070 

.0049 

5 

.0113 

.0140 

.0101 

.0073 

.0052 

.0038 

Table  1:  Comparison  of  Approximations  for  Example  1 
•_  M|H2|1  — »H2  Tandem  Network 


Constraints:  <KX>  =  1.77,  <K2>  —  1.83,  <KXK2>  —  3.14 

■ 

£(gi(Ai,A3)-  «*(Ai,A2))a, 

max  |  ?,  {K  1,/C2)  -  9*(/C1,/C2)  | 

mu 

1.43  XlO"4 

6.20  X10-3 

IB 

1.44  X  10-4 

6.60  XlO-3 

Table  2:  Comparison  of  Approximations  for  Example  2 
M|H2|1  —  I|1  Tandem  Network 


Constraints:  <KX>  —  1.77,  </C2>  =  1.83,  <KXK2>  —  14 

■ 

£(9a^i,/C2)-9*(/C  1,/Cj))*, 

max  |  9;  (/C  j,/C2)  -  9  *  (K  X,K 2)  | 

9l 

1.01  X10'1 

5.76  X10-3 

6.17  X10'2 

?2 

3.60  XlO'3 

4.60  X10'2 

Table  3:  Comparison  of  Approximations  for  Example  3 
M|D|1  — *•  I|1  Tandem  Network 


Constraints:  <KX>  =  1.77,  <K2>  =  1.83,  <KXK2>  —  14 

■ 

max  |  q{  ( K X>K 2)  -  q' (K X,K 2)  | 

KB 

EBB31 

7.05  X10"3 

4.94  X  10~2 

it 

1.53  X10'5 

2.63X10^ 

Table  4:  Comparison  of  Approximations  for  Example  4 
M|M|1  — ►  I|X  Tandem  Network 


Constraints:  <KX>  —  3.36,  <K2>  =2.83,  </C1/C2>  —  13.14 

■ 

mQBH 

m^\qi(Kx,K2)-q'{Kx,K2)\ 

5.56  X10'3 

5.82  X10~2 

KB 

1.91  X10'1 

5.03  X10'3 

5.15  X10’2 
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increase  in  accuracy  in  example  3  and  a  four  percent  increase  in  example  A,  the 
most  correlated  among  the  examples. 

VI.  DISCUSSION 

In  this  paper,  information  theoretic  analysis  was  applied  to  obtain  an 
explicit  maximum  entropy  distribution  for  the  number  of  customers  in  a  G|G|l 
system  to  "minimal"  information.  The  approach  was  then  extended  to  multiple 
simultaneous  G|G|l  systems  where  the  approximation  was  obtained  subject  to 
independent  information  resulting  in  an  independent  (product  form)  distribution. 
The  result  is  of  interest  primarily  because  it  sparked  a  general  discussion  of 
product  form  approximations  where  it  was  determined  that  dependency  of  random 
variables  is  captured  only  by  Imposing  constraints  on  both  the  marginal  and 
joint  distributions.  Moreover,  although  not  directly  applicble,  the  result 
serves  to  stimulate  interest  in  the  use  of  this  inference  technique  for  networks 
of  queueing  systems. 

The  difficulty  in  applying  maximum  entropy  analysis  to  a  network  of  queues 
lies  in  capturing  the  interconnection  of  the  systems.  As  a  first  step,  a  very 
simple  network,  queues  in  tandem,  was  considered.  For  this  network,  the  inter¬ 
connection  of  the  queues  was  captured  by  the  equilibrium  constraint,  which  led 
to  an  explicit  formula  for  the  number  of  customers  in  the  tandem  system  subject 
to  the  mean  number  of  customers  in  each  system.  A  secondary  problem,  that  of 
the  form  of  the  approximation,  was  addressed  by  adding  an  additional  constraint 
on  the  first  joint  moment.  This  result  is  interesting  because  it  identifies 
minimal  information  for  a  joint  system  that  addresses  the  possible  dependency  of 
the  random  variables. 

The  maximum  entropy  approximation  could  prove  useful  in  applications  in 
which  knowledge  of  the  full  probability  distribution  is  necessary.  One  example 
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is  queueing  network  control*  By  strainghtforvard  techniques  (measurement  or 
prediction),  it  is  common  to  know  (or  have  unbiased  estimates  of)  average  flows 
and  average  service  rates  and  thereby  average  queue  sizes.  Higher  order  moments 
such  as  variances  are  needed,  however,  to  apply  modern  distributed  routing 
algorithms  that  utilize  estimates  of  flow  derivatives*  The  more  sophisticated 
versions  of  these  algorithms  require  the  entire  distribution  to  predict  the 
optimal  control  function.  It  should  be  noted  that  the  maximum  entropy  approxi¬ 
mation  has  been  shown  to  be  continuous  as  a  function  of  the  constraints  [23], 
Thus,  if  only  good  estimates  of  moments  are  available  (for  example,  through 
repeated  sampling),  the  approximation  will  be  close  to  the  approximation  subject 
to  the  true  (unknown)  moments. 

The  approximation  could  also  be  used  as  an  additional  tool  in  conventional 
operational  analysis  [24-26].  Indeed,  in  the  present  approach,  standard  perfor¬ 
mance  measures  are  utilized  as  constraints  for  the  distribution.  In  any  case, 
if  the  customer  population  approximation  is  close  to  the  true  distribution  in 
some  precise  sense,  then  it  can  provide  a  more  extensive  analysis  of  system 
interaction.  For  example,  knowledge  of  the  joint  customer  distribution  readily 
provides  the  distribution  for  the  number  of  customers  in  the  network  as  well  as 
marginal  and  conditional  customer  distributions.  From  the  conditional  distribu¬ 
tions,  conditional  moments  can  be  computed  and,  under  appropriate  assumptions, 
estimates  of  conditional  expected  delays  and  sojourn  (response)  times  become 
available. 

Tandem  queues  were  selected  to  introduce  the  approach  to  networks.  Because 
of  the  special  topological  structure,  much  is  known  [21-22,27]  about  such 
systems.  The  focus  here  is  not  the  particular  study  of  tandem  queues  but  an 
initial  consideration  of  the  issues  involving  information  theoretic  analysis  for 
queueing  networks.  It  seems  clear  that  the  approach  applied  to  the  tandem 
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network  can  be  extended  to  more  general  networks  through  imposed  marginal  and 
joint  constraints  along  with  analogous  equilibrium  constraints  that  reflect  the 
topological  structure. 

To  assess  the  quality  of  the  two  maximum  entropy  distributions  as  approxi¬ 
mations  to  the  customer  distribution  of  a  tandem  network,  several  different 
examples  were  presented.  The  particular  case  of  a  two— node  network  was 
considered  where  customers  extracted  service  according  to  a  variety  of  distribu¬ 
tions  and  had  either  identical  or  independent  and  identically  distributed 
service  times  in  each  node.  Based  on  these  examples,  it  appears  that  minimal 
information  results  in  "good"  approximations.  In  particular,  for  small  correla¬ 
tion,  it  appears  that  the  product  form  approximation  is  quite  accurate.  This 
approximation  is  appealing  as  it  has  closed  form  solution.  When,  however,  the 
random  variables  are  strongly  correlated,  the  joint  constraint  should  become 
more  important. 

A  question  that  arises  in  comparing  probability  distributions  is  what 
constitutes  a  good  measure  of  closeness.  For  distributions  such  that  the  prob¬ 
ability  of  any  particular  state  is  small,  for  example,  if  the  mass  is  spread 
"uniformly"  over  a  large  state  space,  then  the  maximum  absolute  pointwise 
difference  of  the  6um  or  the  square  of  the  pointwise  difference  of  the  two 
distributions  is  expected  to  be  small.  In  this  case,  it  appears  that  the 
relative-entropy  between  the  two  distributions  is  the  best  among  the  three  pro¬ 
posed  measures  as  it  relates  the  two  on  a  "micro"  level.  Emphasizing  this 
measure  for  the  tandem  examples,  it  is  reasonable  to  say  that  the  approxima¬ 
tions,  in  particular  those  obtained  with  the  joint  constraint,  are  all  close. 

Of  course,  far  more  analytical  and  nunerical  work  is  needed  to  aay  anything 
precise  about  the  approximations,  but  as  an  inference  technique  for  single 
system  queues  and  possibly  networks  of  queues,  the  results  are  promising. 
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APPENDIX 


Proof  of  Proposition  2, 

For  noCational  simplicity,  let 

K:  -  (K1,K2,...,Kn)  . 

By  system  independence  [17],  if  the  initial  value  distribution  is  of  the  product 
form,  the  relative-entropy  approximation  to  q  (K)  subject  only  to  constraints  on 
the  marginal  distribution,  i.e., 

I  f.(K)q(K)  -  l  f  (K  )q(K)  -  <f  (K  )>  i  -  1,2 . L 

KJ  KJ  J1  j  -  1,2 . M 

is  of  the  product  form, 

N 

q(K)  -  n  q(K  )  .  (29) 

i-1  1 


Now,  (29)  also  satisfies  the  constrained  problem  of  the  relative-entropy 
distribution  subject  to 


l  fj(K.)q(K)  -  <fj(Ki)>  j  -  1,2 . M 

K 


(30) 


and 


N 


l  f#(K)q(K)  -  <f  .(K)>  -  n  <f..(K,)>  l  -  M  +  1,M  +  2 . N  (31) 

Kx  x  i"lX11 


where  <f  (K  )>  is  an  element  of  the  set  of  marginal  constraints  given  by  (30). 

1  / 

The  relative-entropy  solution  subject  to  (30)-(31)  and  normalization  is  given  by 

M  N 

q(K)  -  p(K1)p(K2)  ...  p(KN)exp(-B0  -  \  6  f  -  \  fif)  . 

i-1  l  -M+l  *  * 


However,  by  uniqueness  of  the  relative-entropy  solution,  B  -  0,  M+1<£.<N 
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and  the  product  form  solution  follows. 

Conversely,  if  the  relative-entropy  approximation  is  of  the  product  form, 
and  every  joint  constraint  is  a  Joint  moment  satisfied  by  the  approximation, 
then  it  follows  that 

I  f(K)q(K)  -  <f(K)>  -  l  Kj  ...  K^q(K) 

K  1C 

-  I  K^qOtp  l  K^q(K2)  ...  £  K^qO^)  -  <kJ>  <K^>  ...  <k£>  . 

K1  K2  ’Si 

Hence,  each  joint  constraint  is  equal  to  the  product  of  marginal  constraints 
satisfied  by  the  marginal  relative  entropy  distributions.  Q.E.D. 
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